Temporal difference learning

Results: 95



#Item
21Sutton, Richard  PIN

Sutton, Richard PIN

Add to Reading List

Source URL: incompleteideas.net

Language: English - Date: 2013-10-18 16:05:54
22fourteen declarative principles of experience-oriented intelligence 1. all goals and purposes can be well thought of as the maximization of the expected value of the cumulative sum of a single externally received number

fourteen declarative principles of experience-oriented intelligence 1. all goals and purposes can be well thought of as the maximization of the expected value of the cumulative sum of a single externally received number

Add to Reading List

Source URL: incompleteideas.net

Language: English - Date: 2009-03-27 16:18:08
23Gaussian Process Temporal Difference Learning - Theory and Practice Yaakov Engel Collaborators:  Shie Mannor, Ron Meir, Peter Szabo,

Gaussian Process Temporal Difference Learning - Theory and Practice Yaakov Engel Collaborators: Shie Mannor, Ron Meir, Peter Szabo,

Add to Reading List

Source URL: www.grappa.univ-lille3.fr

Language: English - Date: 2006-07-12 03:07:08
    24Emphatic Temporal-Difference Learning A. Rupam Mahmood Huizhen Yu  Martha White

    Emphatic Temporal-Difference Learning A. Rupam Mahmood Huizhen Yu Martha White

    Add to Reading List

    Source URL: ewrl.files.wordpress.com

    Language: English - Date: 2015-06-22 05:16:35
      25research-article2013 PSRXXX10.1177<italic>Personality and Social Psychology Review</italic>Cushman  Article

      research-article2013 PSRXXX10.1177Personality and Social Psychology ReviewCushman Article

      Add to Reading List

      Source URL: cushmanlab.fas.harvard.edu

      Language: English - Date: 2014-08-18 11:16:16
      26Reinforcement Learning for Declarative Optimization-Based Drama Management Mark J. Nelson, David L. Roberts, Charles L. Isbell, Jr., Michael Mateas College of Computing Georgia Institute of Technology Atlanta, Georgia, U

      Reinforcement Learning for Declarative Optimization-Based Drama Management Mark J. Nelson, David L. Roberts, Charles L. Isbell, Jr., Michael Mateas College of Computing Georgia Institute of Technology Atlanta, Georgia, U

      Add to Reading List

      Source URL: www.kmjn.org

      Language: English - Date: 2015-05-16 18:38:08
      27Lenient Frequency Adjusted Q-learning Daan Bloembergen Michael Kaisers  Karl Tuyls

      Lenient Frequency Adjusted Q-learning Daan Bloembergen Michael Kaisers Karl Tuyls

      Add to Reading List

      Source URL: michaelkaisers.com

      Language: English - Date: 2012-04-29 08:03:24
      28Playing Atari with Deep Reinforcement Learning  Volodymyr Mnih Koray Kavukcuoglu

      Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

      Add to Reading List

      Source URL: www.cs.toronto.edu

      Language: English - Date: 2013-12-19 11:19:32
      29Balancing Anarchy and Central Control Individual vs. Joint Action Reinforcement Learning Daniel Claes June 18, 2010  Abstract

      Balancing Anarchy and Central Control Individual vs. Joint Action Reinforcement Learning Daniel Claes June 18, 2010 Abstract

      Add to Reading List

      Source URL: michaelkaisers.com

      Language: English - Date: 2012-04-29 08:03:28
      30Coevolution of a Backgammon Player Jordan B. Pollack & Alan D. Blair Computer Science Department Volen Center for Complex Systems Brandeis University Waltham, MA 02254

      Coevolution of a Backgammon Player Jordan B. Pollack & Alan D. Blair Computer Science Department Volen Center for Complex Systems Brandeis University Waltham, MA 02254

      Add to Reading List

      Source URL: www.demo.cs.brandeis.edu

      Language: English - Date: 1997-03-13 13:43:33